# Hate speech detection

Metahatebert
Apache-2.0
MetaHateBERT is a text classification model based on the BERT architecture, specifically designed for detecting hate speech.
Text Classification Transformers English
M
irlab-udc
1,456
3
Visobert
ViSoBERT is the first monolingual pre-trained language model specifically designed for Vietnamese social media text, based on the XLM-R architecture, and has demonstrated excellent performance in various Vietnamese social media tasks.
Large Language Model Transformers Other
V
uitnlp
2,260
35
Arabic Xlm Xnli
MIT
Based on the XLM-Roberta-base model, continuously pre-trained on Arabic Twitter corpus and fine-tuned on the XNLI Arabic dataset for zero-shot text classification.
Text Classification Transformers Arabic
A
morit
268
0
Chinese Xlm Xnli
MIT
Based on the XLM-Roberta-base model, it underwent continuous pre-training on a large multilingual Twitter corpus and was fine-tuned on the Chinese XNLI dataset, focusing on zero-shot text classification in Chinese.
Large Language Model Transformers Chinese
C
morit
19
22
French Xlm Xnli
MIT
Based on the XLM-Roberta-base model, continuously pretrained on a multilingual Twitter corpus and fine-tuned on the French XNLI dataset for zero-shot text classification
Text Classification Transformers French
F
morit
686
2
Bert Hateful Memes Expanded
Apache-2.0
A model fine-tuned based on bert-base-uncased for identifying hateful meme text content
Text Classification Transformers
B
limjiayi
29
4
Robertuito Base Uncased
RoBERTuito is a language model pre-trained specifically for Spanish social media text, trained on 500 million tweets, and excels in various Spanish social media text tasks.
Large Language Model Transformers Spanish
R
pysentimiento
1,451
12
Hate Trained 31415
Apache-2.0
Hate speech detection model fine-tuned on distilbert-base-uncased, trained on tweet_eval dataset
Text Classification Transformers
H
marcolatella
15
0
Autonlp Text Hateful Memes 36789092
This is a binary classification model trained via AutoNLP for detecting hate speech content in text.
Text Classification Transformers English
A
am4nsolanki
25
3
Hate Trained 1234567
Apache-2.0
A hate speech detection model fine-tuned on distilbert-base-uncased, trained on the tweet_eval dataset
Text Classification Transformers
H
marcolatella
15
0
Distilbert Base Uncased Hate Speech Offensive Train 16 4
Apache-2.0
A lightweight model based on DistilBERT for detecting hate speech and offensive content in text
Text Classification Transformers
D
SetFit
17
0
Distilbert Base Uncased Hate Speech Offensive Train 16 2
Apache-2.0
A lightweight model based on DistilBERT for detecting hate speech and offensive content in text
Text Classification Transformers
D
SetFit
18
0
Distilbert Base Uncased Hate Speech Offensive Train 8 9
Apache-2.0
Hate speech detection model fine-tuned based on distilbert-base-uncased
Text Classification Transformers
D
SetFit
17
0
Distilbert Base Uncased Hate Speech Offensive Train 8 3
Apache-2.0
This model is a fine-tuned version of distilbert-base-uncased, designed for classifying hate speech and offensive content.
Text Classification Transformers
D
SetFit
17
0
Distilbert Base Uncased Hate Speech Offensive Train 16 8
Apache-2.0
A lightweight model based on DistilBERT, fine-tuned for hate speech and offensive language detection tasks
Text Classification Transformers
D
SetFit
15
0
Dehatebert Mono Indonesian
This model is designed to detect hate speech in Indonesian, fine-tuned based on multilingual BERT and trained exclusively with Indonesian data.
Text Classification
D
Hate-speech-CNERG
186
4
Dehatebert Mono Portugese
Apache-2.0
This model is used to detect hate speech in Portuguese, fine-tuned from a multilingual BERT model as a monolingual model.
Text Classification Other
D
Hate-speech-CNERG
41
4
Bert Base Uncased Hatexplain Rationale Two
Apache-2.0
A BERT-based text classification model for detecting hate speech and offensive content, with rationale prediction capability
Text Classification Transformers English
B
Hate-speech-CNERG
523
12
Distilbert Base Uncased Hate Speech Offensive Train 32 6
Apache-2.0
A lightweight model based on DistilBERT, fine-tuned for hate speech and offensive language detection tasks
Text Classification Transformers
D
SetFit
17
0
Distilbert Base Uncased Hate Speech Offensive Train 16 0
Apache-2.0
A lightweight model based on DistilBERT for detecting hate speech and offensive content in text
Text Classification Transformers
D
SetFit
26
0
Distilbert Base Uncased Hate Speech Offensive Train 8 4
Apache-2.0
Hate speech/offensive language detection model fine-tuned based on DistilBERT-base-uncased
Text Classification Transformers
D
SetFit
15
0
Bert Base Uncased Hatexplain
Apache-2.0
HateXplain is a text classification model for detecting hate speech, offensive content, and normal content, trained on data from Gab and Twitter, with enhanced performance through human-annotated rationale.
Text Classification English
B
Hate-speech-CNERG
3,831
21
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase